Function Words in Authorship Attribution. From Black Magic to Theory?
نویسنده
چکیده
This position paper focuses on the use of function words in computational authorship attribution. Although recently there have been multiple successful applications of authorship attribution, the field is not particularly good at the explication of methods and theoretical issues, which might eventually compromise the acceptance of new research results in the traditional humanities community. I wish to partially help remedy this lack of explication and theory, by contributing a theoretical discussion on the use of function words in stylometry. I will concisely survey the attractiveness of function words in stylometry and relate them to the use of character n-grams. At the end of this paper, I will propose to replace the term ‘function word’ by the term ‘functor’ in stylometry, due to multiple theoretical considerations.
منابع مشابه
Function Words for Chinese Authorship Attribution
This study explores the use of function words for authorship attribution in modern Chinese (C-FWAA). This study consists of three tasks: (1) examine the C-FWAA effectiveness in three genres: novel, essay, and blog; (2) compare the strength of function words as both genre and authorship indicators, and explore the genre interference on C-FWAA; (3) examine whether C-FWAA is sensitive to the time ...
متن کاملTowards a better understanding of Burrows's Delta in literary authorship attribution
Burrows’s Delta is the most established measure for stylometric difference in literary authorship attribution. Several improvements on the original Delta have been proposed. However, a recent empirical study showed that none of the proposed variants constitute a major improvement in terms of authorship attribution performance. With this paper, we try to improve our understanding of how and why ...
متن کاملMore than Word Frequencies: Authorship Attribution via Natural Frequency Zoned Word Distribution Analysis
With such increasing popularity and availability of digital text data, authorships of digital texts can not be taken for granted due to the ease of copying and parsing. This paper presents a new text style analysis called natural frequency zoned word distribution analysis (NFZ-WDA), and then a basic authorship attribution scheme and an open authorship attribution scheme for digital texts based ...
متن کاملEffects of Subliminal Priming of Self and God on Self-Attribution of Authorship for Events
Three studies investigated how subliminally primed thoughts of an agent prior to action can aVect ascriptions of authorship for that action. Participants competed against a computer program to remove words from a computer screen. Participants reported greater feelings of authorship when primed with Wrst person singular pronouns, and lower feelings of authorship when primed with “computer.” We a...
متن کاملEVects of subliminal priming of self and God on self-attribution of authorship for events
Three studies investigated how subliminally primed thoughts of an agent prior to action can aVect ascriptions of authorship for that action. Participants competed against a computer program to remove words from a computer screen. Participants reported greater feelings of authorship when primed with Wrst person singular pronouns, and lower feelings of authorship when primed with “computer.” We a...
متن کامل